Improved Spatio-temporal Salient Feature Detection for Action Recognition

نویسندگان

  • Amir Hossein Shabani
  • David A. Clausi
  • John S. Zelek
چکیده

Spatio-temporal salient features localize the local motion events and are used to represent video sequences for many computer vision tasks such as action recognition. The robust detection of these features under geometric variations such as affine transformation and view/scale changes is however an open problem. Existing methods use the same filter for both time and space and hence, perform an isotropic temporal filtering. The contribution of this paper is on improving the temporal filtering for robust salient feature detection (Fig. 1) and consequently, improving the action recognition accuracy. We thus develop a novel biologicallyconsistent multi-resolution filtering which is asymmetric and causal in the temporal direction. This development is based on an anisotropic diffusion to the past frames. Given the functional nature of this filter, we refer to it as the "asymmetric sinc". We also introduce the use of Poisson and truncated exponential filters. We then compare the complex form of these three asymmetric filters with the standard symmetric Gabor filtering (Fig. 2). The performance of these filters are evaluated by answering three research questions: (1) whether asymmetric temporal filtering is better than symmetric Gabor filtering for salient motion feature detection, (2) which filter is the best, and (3) the features detected by which filter provide higher action classification accuracy. We use a common framework (Fig. 1) for spatio-temporal salient feature detection to investigate the role of differen temporal multi-resolution filters. To be consistent with the biological vision, we compute a phase and contrast polarity insensitive [2] motion map. We preformed three experimental tests of precision, reproducibility, and action classification to evaluate the salient features. The precision score determines the ratio of the true positive features over all detected features. The reproducibility score determines the percentage of re-detected salient features under different geometric deformations such as scale/view change or affine transformation. To provide quantitative results for the precision and reproducibility tests, we used the Weizmann classification and robustness datasets for which the ground truth (i.e., foreground masks) is provided. For the action classification test, we used the KTH and the UCF sports datasets as well. A discriminative bottom-up approach based on bag-of-words (BOW) representation of the video content is used for the action classification. Our observation (Fig. 3) is that the asymmetric temporal filters perform better than the symmetric Gabor filtering in detecting precise, robust, and informative salient features. Among all, our asymmetric sinc filtering performs the best. Without video segmentation, our 95.5% accuracy on the Weizmann dataset and 93.3% accuracy on the KTH dataset competes with the state-of-the-art methods which use similar discriminative BOW setting. Moreover, our 91.5% accuracy on the UCF sports dataset is much better than the dense trajectory [4] with 88.2%, the unsupervised feature learning [1] with 86.5%, and the dense sampling [3] with 85.6% accuracy. This is in contrast to the previous observations about better performance of dense sampling/trajectories, showing the importance of proper temporal filtering for robust and informative salient feature detection.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Study of Human Action Recognition Based on Improved Spatio-temporal Features

Most of the existed action recognition methods mainly utilize spatio-temporal descriptors of single interest point ignoring their potential integral information, such as spatial distribution information. By combining local spatio-temporal feature and global positional distribution information (PDI) of interest points,a novel motion descriptor is proposed in this paper. The proposed method detec...

متن کامل

Adaptive Tuboid Shapes for Action Recognition

Encoding local motion information using spatio-temporal features is a common approach in action recognition methods. These features are based on the information content inside subregions extracted at locations of interest in a video. In this paper, we propose a conceptually different approach to video feature extraction. We adopt an entropybased saliency framework and develop a method for estim...

متن کامل

Recognition of Visual Events using Spatio-Temporal Information of the Video Signal

Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...

متن کامل

Human action categorization using discriminative local spatio-temporal feature weighting

New methods based on local spatio-temporal features have exhibited significant performance in action recognition. In these methods, feature selection plays an important role to achieve a superior performance. Actions are represented by local spatio-temporal features extracted from action videos. Action representations are then classified by applying a classifier (such as k-nearest neighbor or S...

متن کامل

Spatio-temporal layout of human actions for improved bag-of-words action detection

5 We investigate how human action recognition can be improved by considering spatio-temporal layout of 6 actions. From literature, we adopt a pipeline consisting of STIP features, a random forest to quantize the 7 features into histograms, and an SVM classifier. Our goal is to detect 48 human actions, ranging from simple 8 actions such as walk to complex actions such as exchange. Our contributi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011